Second-order Statistics Are Less Important for Audio Textures than Image Textures
نویسندگان
چکیده
This letter proposes to adapt the techniques of image texture analysis to the context of audio signals. Dynamics of audio frame-level features are particularly difficult to model, although they have been identified as crucial perceptive dimensions of timbre perception. Recent studies have given ample evidence that traditional means to model data dynamics, such as delta-coefficients, texture windows or Markov modelling, do not provide any improvement over the best static models. The situation is completely different for image texture analysis, where second-order statistical analysis, based on co-occurrence matrices have proven to be an efficient measure for texture similarity. This paper reports on experiments to adapt these image analysis techniques to audio signals. Results show that co-occurrence analysis of sound textures based on vector-quantized MFCC features do not provide any advantage over first-order histograms in the building of content-based similarity measures. This suggests that second-order statistics, in the form considered in this study, are not a factor as crucial for the perception of sound textures as it is for image textures. EDICS Category: AEA-AUEA,IMD-ANAL
منابع مشابه
Second-order Statistics Are Less Important for Audio Textures than for Image Textures
This paper proposes to adapt the techniques of image texture analysis to the context of audio signals. Dynamics of audio frame-level features are particularly difficult to model, although they have been identified as crucial perceptive dimensions of timbre perception. Recent studies have given ample evidence that traditional means to model data dynamics, such as delta-coefficients, texture wind...
متن کاملAdapting Image Texture Co-occurrence Analysis for Audio Texture Similarity
In this letter, we adapt a well-known technique of image texture analysis (grey-level co-occurrence matrix) to compute similarity between musical audio signals. Grey-level cooccurrence matrices estimate the joint probability of pairs of pixel values separated by a spatial displacement vector. Instead of using pixel grey-level values, we propose to use frame-based audio features, obtained either...
متن کاملThe role of higher order image statistics in masking scene gist recognition.
In the present article, we investigated whether higher order image statistics, which are known to be carried by the Fourier phase spectrum, are sufficient to affect scene gist recognition. In Experiment 1, we compared the scene gist masking strength of four masking image types that varied in their degrees of second- and higher order relationships: normal scene images, scene textures, phase-rand...
متن کاملA Generative Model of Natural Texture Surrogates
Natural images can be seen as patchworks of different textures for which local image statistics are roughly stationary within confined regions but otherwise can be highly diverse. In order to model the natural variety of textures, we sampled 64×64 patches of homogeneous textures from a large image database and described each patch by a set of texture parameters obtained with a popular texture a...
متن کاملBeyond fourth-order texture discrimination: generation of extreme-order and statistically-balanced textures
Julesz introduced the concept of statistically defined textures and their perceptual discrimination. Julesz showed that discrimination was possible with statistics equated to third-order, specifying fourth-order textures. Klein and Tyler offered a variety of paradigms suggesting that fourth order might be the limit on human texture processing. To go beyond this limit, new texture paradigms are ...
متن کامل